Linguistically-motivated automatic classification of regional French varieties

نویسندگان

  • Cécile Woehrling
  • Philippe Boula de Mareüil
  • Martine Adda-Decker
چکیده

The goal of this study is to automatically differentiate French varieties (standard French and French varieties spoken in the South of France, Alsace, Belgium and Switzerland) by applying a linguistically-motivated approach. We took advantage of automatic phoneme alignment to measure vowel formants, consonant (de)voicing, pronunciation variants as well as prosodic cues. These features were then used to identify French varieties by applying classification techniques. On large corpora of hundreds of speakers, over 80% correct identification scores were obtained. The confusions between varieties and the features used (by decision trees) are linguistically grounded.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An evaluation of machine learning methods for prominence detection in French

The automatic detection of prosodically prominent syllables is crucial for analysing speech, especially in French where prominence contributes substantially to prosodic grouping and boundary demarcation. In this paper, we compare different machine learning techniques for the automatic detection of prominent syllables, using prosodic features (including pitch, energy, duration and spectral balan...

متن کامل

Speech Prosody of French Regional Varieties

This paper compares the prosody of 6 varieties of French spoken in three different areas: France (Paris and Lyon), Belgium (Tournai and Liège), and Switzerland (Geneva and Neuchâtel). The objective is to adress whether some regional varieties, namely those of Geneva and Tournai, are closer to standard French (i.e. the varieties spoken in France, represented here by Paris and Lyon) than others (...

متن کامل

Foreign and regional accents in French. Characterisation and identification

This research focuses on the identification and characterisation of accents in French. For both foreign and regional accents, we started with perceptual identification experiments, we measured phonetic features which may characterise these accents using automatic phoneme alignment, and we ranked the most discriminating features by using classification techniques. The following features are perc...

متن کامل

Experiments with the ABI (accents of the british isles) speech corpus

The ABI (Accents of the British Isles) speech corpus contains approximately 90 hours of speech from approximately 280 speakers representing 14 different regional accents of British and Irish English. ABI includes a combination of applicationsoriented and linguistically-motivated material. This paper describes experiments in which the ABI corpus is used to study the effects of these regional acc...

متن کامل

Regional Variations of Speech Rhythm in French: In Search of Lost Times

This paper addresses the relevance of speech rhythm acoustic measures for the description of some standard, regional and contact varieties of French. First, the limitation of conventional speech rhythm measures (e.g. %V, ΔC or PVI) for the description of French regional variations is pointed out. Then, alternative acoustic measures of speech rhythm, based on supra-segmental characteristics asso...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009